Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 118917 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 17.2 MiB |
| Average record size in memory | 152.0 B |
Variable types
| Text | 6 |
|---|---|
| Numeric | 9 |
| Categorical | 2 |
| DateTime | 2 |
che_pc_usd is highly overall correlated with che_perc_gdp and 1 other fields | High correlation |
che_perc_gdp is highly overall correlated with che_pc_usd and 1 other fields | High correlation |
country is highly overall correlated with che_pc_usd and 4 other fields | High correlation |
insurance_perc_che is highly overall correlated with country | High correlation |
population is highly overall correlated with country | High correlation |
prev_perc is highly overall correlated with price_unit | High correlation |
price_month is highly overall correlated with price_unit | High correlation |
price_unit is highly overall correlated with prev_perc and 1 other fields | High correlation |
public_perc_che is highly overall correlated with country | High correlation |
price_unit is highly skewed (γ1 = 87.52046958) | Skewed |
Reproduction
| Analysis started | 2024-11-29 02:50:07.344272 |
|---|---|
| Analysis finished | 2024-11-29 02:50:17.806013 |
| Duration | 10.46 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
brand
Text
| Distinct | 591 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 34 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | BRAND_354E |
|---|---|
| 2nd row | BRAND_626D |
| 3rd row | BRAND_45D9 |
| 4th row | BRAND_D724 |
| 5th row | BRAND_4887 |
| Value | Count | Frequency (%) |
| brand_0056 | 2366 | 2.0% |
| brand_62c7 | 1875 | 1.6% |
| brand_7a2e | 1762 | 1.5% |
| brand_a12a | 1761 | 1.5% |
| brand_4048 | 1642 | 1.4% |
| brand_076f | 1521 | 1.3% |
| brand_cfd9 | 1449 | 1.2% |
| brand_71fa | 1435 | 1.2% |
| brand_fcb2 | 1369 | 1.2% |
| brand_d724 | 1305 | 1.1% |
| Other values (581) | 102432 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 152540 | |
| B | 151293 | |
| D | 144448 | |
| N | 118917 | |
| _ | 118917 | |
| R | 118917 | |
| 6 | 36146 | 3.0% |
| 2 | 34638 | 2.9% |
| 0 | 33599 | 2.8% |
| 9 | 31823 | 2.7% |
| Other values (9) | 247932 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1189170 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 152540 | |
| B | 151293 | |
| D | 144448 | |
| N | 118917 | |
| _ | 118917 | |
| R | 118917 | |
| 6 | 36146 | 3.0% |
| 2 | 34638 | 2.9% |
| 0 | 33599 | 2.8% |
| 9 | 31823 | 2.7% |
| Other values (9) | 247932 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1189170 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 152540 | |
| B | 151293 | |
| D | 144448 | |
| N | 118917 | |
| _ | 118917 | |
| R | 118917 | |
| 6 | 36146 | 3.0% |
| 2 | 34638 | 2.9% |
| 0 | 33599 | 2.8% |
| 9 | 31823 | 2.7% |
| Other values (9) | 247932 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1189170 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 152540 | |
| B | 151293 | |
| D | 144448 | |
| N | 118917 | |
| _ | 118917 | |
| R | 118917 | |
| 6 | 36146 | 3.0% |
| 2 | 34638 | 2.9% |
| 0 | 33599 | 2.8% |
| 9 | 31823 | 2.7% |
| Other values (9) | 247932 |
che_pc_usd
Real number (ℝ)
High correlation 
| Distinct | 396 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5293892 |
| Minimum | -1 |
|---|---|
| Maximum | 2.6569132 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 741 |
| Negative (%) | 0.6% |
| Memory size | 929.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1.05696 |
| Q1 | 1.1799313 |
| median | 1.4723783 |
| Q3 | 1.8164794 |
| 95-th percentile | 2.2862047 |
| Maximum | 2.6569132 |
| Range | 3.6569132 |
| Interquartile range (IQR) | 0.63654806 |
Descriptive statistics
| Standard deviation | 0.43909375 |
|---|---|
| Coefficient of variation (CV) | 0.28710399 |
| Kurtosis | 5.6968711 |
| Mean | 1.5293892 |
| Median Absolute Deviation (MAD) | 0.31148564 |
| Skewness | -0.6786595 |
| Sum | 181870.38 |
| Variance | 0.19280332 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.680087391 | 1568 | 1.3% |
| 1.05696005 | 1172 | 1.0% |
| 1.104712859 | 1070 | 0.9% |
| 1.418695381 | 1006 | 0.8% |
| 1.448033708 | 961 | 0.8% |
| 1.423064919 | 957 | 0.8% |
| 1.07943196 | 929 | 0.8% |
| 1.678682896 | 881 | 0.7% |
| 2.503901373 | 871 | 0.7% |
| 1.660892634 | 855 | 0.7% |
| Other values (386) | 108647 |
| Value | Count | Frequency (%) |
| -1 | 741 | |
| 1 | 10 | < 0.1% |
| 1.00062422 | 12 | < 0.1% |
| 1.001560549 | 12 | < 0.1% |
| 1.001872659 | 18 | < 0.1% |
| 1.002340824 | 24 | < 0.1% |
| 1.002808989 | 14 | < 0.1% |
| 1.003277154 | 12 | < 0.1% |
| 1.003755484 | 12 | < 0.1% |
| 1.004057428 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2.656913233 | 676 | |
| 2.604244694 | 804 | |
| 2.535736579 | 850 | |
| 2.503901373 | 871 | |
| 2.494382022 | 690 | |
| 2.490168539 | 175 | 0.1% |
| 2.468320849 | 548 | |
| 2.459581773 | 368 | |
| 2.442883895 | 20 | < 0.1% |
| 2.437421973 | 11 | < 0.1% |
che_perc_gdp
Real number (ℝ)
High correlation 
| Distinct | 409 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6070998 |
| Minimum | -1 |
|---|---|
| Maximum | 2.3111028 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 3644 |
| Negative (%) | 3.1% |
| Memory size | 929.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1.0731578 |
| Q1 | 1.4648487 |
| median | 1.731474 |
| Q3 | 1.8941392 |
| 95-th percentile | 2.0517702 |
| Maximum | 2.3111028 |
| Range | 3.3111028 |
| Interquartile range (IQR) | 0.42929047 |
Descriptive statistics
| Standard deviation | 0.53799405 |
|---|---|
| Coefficient of variation (CV) | 0.33476082 |
| Kurtosis | 14.067911 |
| Mean | 1.6070998 |
| Median Absolute Deviation (MAD) | 0.19446208 |
| Skewness | -3.4326905 |
| Sum | 191111.49 |
| Variance | 0.28943759 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 3644 | 3.1% |
| 1.730520606 | 1006 | 0.8% |
| 1.941136092 | 961 | 0.8% |
| 1.753337696 | 957 | 0.8% |
| 1.977700795 | 881 | 0.7% |
| 1.877291415 | 871 | 0.7% |
| 2.035271828 | 863 | 0.7% |
| 1.659159354 | 855 | 0.7% |
| 2.054177364 | 851 | 0.7% |
| 2.037798443 | 851 | 0.7% |
| Other values (399) | 107177 |
| Value | Count | Frequency (%) |
| -1 | 3644 | |
| 1 | 10 | < 0.1% |
| 1.020151729 | 12 | < 0.1% |
| 1.036612732 | 12 | < 0.1% |
| 1.04042123 | 533 | 0.4% |
| 1.043470525 | 104 | 0.1% |
| 1.044247892 | 24 | < 0.1% |
| 1.046249573 | 59 | < 0.1% |
| 1.051142034 | 18 | < 0.1% |
| 1.051941331 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 2.311102801 | 617 | |
| 2.208068896 | 787 | |
| 2.206787733 | 501 | |
| 2.160170084 | 553 | |
| 2.146527555 | 14 | < 0.1% |
| 2.126096163 | 263 | 0.2% |
| 2.103845172 | 96 | 0.1% |
| 2.102405175 | 731 | |
| 2.085983813 | 118 | 0.1% |
| 2.085917456 | 99 | 0.1% |
cluster_nl
Text
| Distinct | 2716 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Unique
| Unique | 63 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | BRAND_354E_COUNTRY_88A3 |
|---|---|
| 2nd row | BRAND_626D_COUNTRY_8B47 |
| 3rd row | BRAND_45D9_COUNTRY_88A3 |
| 4th row | BRAND_D724_COUNTRY_445D |
| 5th row | BRAND_4887_COUNTRY_D8B0 |
| Value | Count | Frequency (%) |
| brand_354e_country_88a3 | 60 | 0.1% |
| brand_3ba7_country_445d | 60 | 0.1% |
| brand_c21a_country_4442 | 60 | 0.1% |
| brand_7a2e_country_d5b9 | 60 | 0.1% |
| brand_f886_country_d5b9 | 60 | 0.1% |
| brand_88b9_country_53a5 | 60 | 0.1% |
| brand_ccaa_country_c8f4 | 60 | 0.1% |
| brand_e551_country_8dbb | 60 | 0.1% |
| brand_061c_country_6f78 | 60 | 0.1% |
| brand_061c_country_1007 | 60 | 0.1% |
| Other values (2706) | 118317 |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 356751 | |
| N | 237834 | 8.7% |
| R | 237834 | 8.7% |
| A | 178401 | 6.5% |
| B | 178143 | 6.5% |
| D | 172986 | 6.3% |
| C | 163356 | 6.0% |
| Y | 118917 | 4.3% |
| T | 118917 | 4.3% |
| U | 118917 | 4.3% |
| Other values (13) | 853035 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2735091 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| _ | 356751 | |
| N | 237834 | 8.7% |
| R | 237834 | 8.7% |
| A | 178401 | 6.5% |
| B | 178143 | 6.5% |
| D | 172986 | 6.3% |
| C | 163356 | 6.0% |
| Y | 118917 | 4.3% |
| T | 118917 | 4.3% |
| U | 118917 | 4.3% |
| Other values (13) | 853035 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2735091 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| _ | 356751 | |
| N | 237834 | 8.7% |
| R | 237834 | 8.7% |
| A | 178401 | 6.5% |
| B | 178143 | 6.5% |
| D | 172986 | 6.3% |
| C | 163356 | 6.0% |
| Y | 118917 | 4.3% |
| T | 118917 | 4.3% |
| U | 118917 | 4.3% |
| Other values (13) | 853035 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2735091 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| _ | 356751 | |
| N | 237834 | 8.7% |
| R | 237834 | 8.7% |
| A | 178401 | 6.5% |
| B | 178143 | 6.5% |
| D | 172986 | 6.3% |
| C | 163356 | 6.0% |
| Y | 118917 | 4.3% |
| T | 118917 | 4.3% |
| U | 118917 | 4.3% |
| Other values (13) | 853035 |
corporation
Text
| Distinct | 136 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CORP_D524 |
|---|---|
| 2nd row | CORP_01C7 |
| 3rd row | CORP_39F7 |
| 4th row | CORP_711A |
| 5th row | CORP_443D |
| Value | Count | Frequency (%) |
| corp_01c7 | 19445 | |
| corp_5cbd | 9004 | 7.6% |
| corp_c868 | 7771 | 6.5% |
| corp_8f4f | 6502 | 5.5% |
| corp_a713 | 5879 | 4.9% |
| corp_443d | 5309 | 4.5% |
| corp_39f7 | 5149 | 4.3% |
| corp_a682 | 5070 | 4.3% |
| corp_09bb | 4726 | 4.0% |
| corp_a278 | 4035 | 3.4% |
| Other values (126) | 46027 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 162413 | |
| R | 118917 | |
| P | 118917 | |
| _ | 118917 | |
| O | 118917 | |
| 7 | 52531 | 4.9% |
| 8 | 44659 | 4.2% |
| 1 | 39553 | 3.7% |
| 0 | 34920 | 3.3% |
| B | 30737 | 2.9% |
| Other values (10) | 229772 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1070253 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 162413 | |
| R | 118917 | |
| P | 118917 | |
| _ | 118917 | |
| O | 118917 | |
| 7 | 52531 | 4.9% |
| 8 | 44659 | 4.2% |
| 1 | 39553 | 3.7% |
| 0 | 34920 | 3.3% |
| B | 30737 | 2.9% |
| Other values (10) | 229772 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1070253 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 162413 | |
| R | 118917 | |
| P | 118917 | |
| _ | 118917 | |
| O | 118917 | |
| 7 | 52531 | 4.9% |
| 8 | 44659 | 4.2% |
| 1 | 39553 | 3.7% |
| 0 | 34920 | 3.3% |
| B | 30737 | 2.9% |
| Other values (10) | 229772 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1070253 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 162413 | |
| R | 118917 | |
| P | 118917 | |
| _ | 118917 | |
| O | 118917 | |
| 7 | 52531 | 4.9% |
| 8 | 44659 | 4.2% |
| 1 | 39553 | 3.7% |
| 0 | 34920 | 3.3% |
| B | 30737 | 2.9% |
| Other values (10) | 229772 |
country
Categorical
High correlation 
| Distinct | 49 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
| COUNTRY_907E | 5522 |
|---|---|
| COUNTRY_3AD0 | 5002 |
| COUNTRY_89F9 | 4811 |
| COUNTRY_53A5 | 4782 |
| COUNTRY_D8B0 | 4654 |
| Other values (44) |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | COUNTRY_88A3 |
|---|---|
| 2nd row | COUNTRY_8B47 |
| 3rd row | COUNTRY_88A3 |
| 4th row | COUNTRY_445D |
| 5th row | COUNTRY_D8B0 |
Common Values
| Value | Count | Frequency (%) |
| COUNTRY_907E | 5522 | 4.6% |
| COUNTRY_3AD0 | 5002 | 4.2% |
| COUNTRY_89F9 | 4811 | 4.0% |
| COUNTRY_53A5 | 4782 | 4.0% |
| COUNTRY_D8B0 | 4654 | 3.9% |
| COUNTRY_445D | 4547 | 3.8% |
| COUNTRY_4242 | 4425 | 3.7% |
| COUNTRY_1007 | 4119 | 3.5% |
| COUNTRY_9891 | 3891 | 3.3% |
| COUNTRY_6F78 | 3810 | 3.2% |
| Other values (39) | 73354 |
Length
| Value | Count | Frequency (%) |
| country_907e | 5522 | 4.6% |
| country_3ad0 | 5002 | 4.2% |
| country_89f9 | 4811 | 4.0% |
| country_53a5 | 4782 | 4.0% |
| country_d8b0 | 4654 | 3.9% |
| country_445d | 4547 | 3.8% |
| country_4242 | 4425 | 3.7% |
| country_1007 | 4119 | 3.5% |
| country_9891 | 3891 | 3.3% |
| country_6f78 | 3810 | 3.2% |
| Other values (39) | 73354 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 132674 | 9.3% |
| O | 118917 | 8.3% |
| U | 118917 | 8.3% |
| N | 118917 | 8.3% |
| T | 118917 | 8.3% |
| R | 118917 | 8.3% |
| Y | 118917 | 8.3% |
| _ | 118917 | 8.3% |
| 4 | 48840 | 3.4% |
| 8 | 44576 | 3.1% |
| Other values (13) | 368495 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1427004 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 132674 | 9.3% |
| O | 118917 | 8.3% |
| U | 118917 | 8.3% |
| N | 118917 | 8.3% |
| T | 118917 | 8.3% |
| R | 118917 | 8.3% |
| Y | 118917 | 8.3% |
| _ | 118917 | 8.3% |
| 4 | 48840 | 3.4% |
| 8 | 44576 | 3.1% |
| Other values (13) | 368495 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1427004 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 132674 | 9.3% |
| O | 118917 | 8.3% |
| U | 118917 | 8.3% |
| N | 118917 | 8.3% |
| T | 118917 | 8.3% |
| R | 118917 | 8.3% |
| Y | 118917 | 8.3% |
| _ | 118917 | 8.3% |
| 4 | 48840 | 3.4% |
| 8 | 44576 | 3.1% |
| Other values (13) | 368495 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1427004 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 132674 | 9.3% |
| O | 118917 | 8.3% |
| U | 118917 | 8.3% |
| N | 118917 | 8.3% |
| T | 118917 | 8.3% |
| R | 118917 | 8.3% |
| Y | 118917 | 8.3% |
| _ | 118917 | 8.3% |
| 4 | 48840 | 3.4% |
| 8 | 44576 | 3.1% |
| Other values (13) | 368495 |
launch_date
Date
| Distinct | 103 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
| Minimum | 2014-06-01 00:00:00 |
|---|---|
| Maximum | 2022-12-01 00:00:00 |
date
Date
| Distinct | 103 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
| Minimum | 2014-06-01 00:00:00 |
|---|---|
| Maximum | 2022-12-01 00:00:00 |
drug_id
Text
| Distinct | 257 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | DRUG_ID_8795 |
|---|---|
| 2nd row | DRUG_ID_E66E |
| 3rd row | DRUG_ID_F272 |
| 4th row | DRUG_ID_1D4E |
| 5th row | DRUG_ID_AA88 |
| Value | Count | Frequency (%) |
| drug_id_3a6f | 3035 | 2.6% |
| drug_id_d637 | 2567 | 2.2% |
| drug_id_b15f | 1852 | 1.6% |
| drug_id_30f8 | 1841 | 1.5% |
| drug_id_b633 | 1828 | 1.5% |
| drug_id_3416 | 1702 | 1.4% |
| drug_id_be46 | 1676 | 1.4% |
| drug_id_eee7 | 1592 | 1.3% |
| drug_id_473b | 1544 | 1.3% |
| drug_id_b8a8 | 1521 | 1.3% |
| Other values (247) | 99759 |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 267269 | |
| _ | 237834 | |
| R | 118917 | |
| U | 118917 | |
| G | 118917 | |
| I | 118917 | |
| 7 | 44979 | 3.2% |
| 3 | 41096 | 2.9% |
| 8 | 35879 | 2.5% |
| B | 33678 | 2.4% |
| Other values (11) | 290601 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1427004 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| D | 267269 | |
| _ | 237834 | |
| R | 118917 | |
| U | 118917 | |
| G | 118917 | |
| I | 118917 | |
| 7 | 44979 | 3.2% |
| 3 | 41096 | 2.9% |
| 8 | 35879 | 2.5% |
| B | 33678 | 2.4% |
| Other values (11) | 290601 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1427004 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| D | 267269 | |
| _ | 237834 | |
| R | 118917 | |
| U | 118917 | |
| G | 118917 | |
| I | 118917 | |
| 7 | 44979 | 3.2% |
| 3 | 41096 | 2.9% |
| 8 | 35879 | 2.5% |
| B | 33678 | 2.4% |
| Other values (11) | 290601 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1427004 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| D | 267269 | |
| _ | 237834 | |
| R | 118917 | |
| U | 118917 | |
| G | 118917 | |
| I | 118917 | |
| 7 | 44979 | 3.2% |
| 3 | 41096 | 2.9% |
| 8 | 35879 | 2.5% |
| B | 33678 | 2.4% |
| Other values (11) | 290601 |
ind_launch_date
Text
| Distinct | 123 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 2 |
| Mean length | 6.4488172 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | -1 |
|---|---|
| 2nd row | 2014-09-01 00:00:00 |
| 3rd row | -1 |
| 4th row | -1 |
| 5th row | -1 |
| Value | Count | Frequency (%) |
| 1 | 87797 | |
| 00:00:00 | 31120 | 20.7% |
| 2019-11-01 | 2234 | 1.5% |
| 2020-03-01 | 1904 | 1.3% |
| 2019-08-01 | 1465 | 1.0% |
| 2020-01-01 | 1154 | 0.8% |
| 2019-07-01 | 967 | 0.6% |
| 2020-08-01 | 876 | 0.6% |
| 2020-02-01 | 860 | 0.6% |
| 2020-11-01 | 816 | 0.5% |
| Other values (114) | 20844 | 13.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 280359 | |
| 1 | 159617 | |
| - | 150037 | |
| : | 62240 | 8.1% |
| 2 | 50583 | 6.6% |
| 31120 | 4.1% | |
| 9 | 9159 | 1.2% |
| 7 | 5597 | 0.7% |
| 8 | 5443 | 0.7% |
| 3 | 4295 | 0.6% |
| Other values (3) | 8424 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 766874 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 280359 | |
| 1 | 159617 | |
| - | 150037 | |
| : | 62240 | 8.1% |
| 2 | 50583 | 6.6% |
| 31120 | 4.1% | |
| 9 | 9159 | 1.2% |
| 7 | 5597 | 0.7% |
| 8 | 5443 | 0.7% |
| 3 | 4295 | 0.6% |
| Other values (3) | 8424 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 766874 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 280359 | |
| 1 | 159617 | |
| - | 150037 | |
| : | 62240 | 8.1% |
| 2 | 50583 | 6.6% |
| 31120 | 4.1% | |
| 9 | 9159 | 1.2% |
| 7 | 5597 | 0.7% |
| 8 | 5443 | 0.7% |
| 3 | 4295 | 0.6% |
| Other values (3) | 8424 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 766874 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 280359 | |
| 1 | 159617 | |
| - | 150037 | |
| : | 62240 | 8.1% |
| 2 | 50583 | 6.6% |
| 31120 | 4.1% | |
| 9 | 9159 | 1.2% |
| 7 | 5597 | 0.7% |
| 8 | 5443 | 0.7% |
| 3 | 4295 | 0.6% |
| Other values (3) | 8424 | 1.1% |
indication
Text
| Distinct | 257 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
Length
| Max length | 168 |
|---|---|
| Median length | 12 |
| Mean length | 18.811373 |
| Min length | 12 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ['IND_C3B6'] |
|---|---|
| 2nd row | ['IND_1590', 'IND_ECAC'] |
| 3rd row | ['IND_B2EF'] |
| 4th row | ['IND_BAFB'] |
| 5th row | ['IND_3F31'] |
| Value | Count | Frequency (%) |
| ind_3a0d | 20621 | 11.1% |
| ind_617c | 12986 | 7.0% |
| ind_b2ef | 11227 | 6.0% |
| ind_f338 | 7303 | 3.9% |
| ind_da0b | 7185 | 3.9% |
| ind_c3b6 | 6814 | 3.7% |
| ind_7c11 | 5678 | 3.0% |
| ind_bafb | 5611 | 3.0% |
| ind_bd8b | 5345 | 2.9% |
| ind_c829 | 4614 | 2.5% |
| Other values (145) | 99032 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 372832 | |
| D | 241283 | 10.8% |
| I | 186416 | 8.3% |
| N | 186416 | 8.3% |
| _ | 186416 | 8.3% |
| [ | 118917 | 5.3% |
| ] | 118917 | 5.3% |
| , | 67499 | 3.0% |
| 67499 | 3.0% | |
| A | 61225 | 2.7% |
| Other values (14) | 629572 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2236992 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| ' | 372832 | |
| D | 241283 | 10.8% |
| I | 186416 | 8.3% |
| N | 186416 | 8.3% |
| _ | 186416 | 8.3% |
| [ | 118917 | 5.3% |
| ] | 118917 | 5.3% |
| , | 67499 | 3.0% |
| 67499 | 3.0% | |
| A | 61225 | 2.7% |
| Other values (14) | 629572 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2236992 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| ' | 372832 | |
| D | 241283 | 10.8% |
| I | 186416 | 8.3% |
| N | 186416 | 8.3% |
| _ | 186416 | 8.3% |
| [ | 118917 | 5.3% |
| ] | 118917 | 5.3% |
| , | 67499 | 3.0% |
| 67499 | 3.0% | |
| A | 61225 | 2.7% |
| Other values (14) | 629572 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2236992 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| ' | 372832 | |
| D | 241283 | 10.8% |
| I | 186416 | 8.3% |
| N | 186416 | 8.3% |
| _ | 186416 | 8.3% |
| [ | 118917 | 5.3% |
| ] | 118917 | 5.3% |
| , | 67499 | 3.0% |
| 67499 | 3.0% | |
| A | 61225 | 2.7% |
| Other values (14) | 629572 |
insurance_perc_che
Real number (ℝ)
High correlation 
| Distinct | 81 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0089991 |
| Minimum | -1 |
|---|---|
| Maximum | 2.04 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 23190 |
| Negative (%) | 19.5% |
| Memory size | 929.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | 1 |
| median | 1.3466667 |
| Q3 | 1.76 |
| 95-th percentile | 2.0266667 |
| Maximum | 2.04 |
| Range | 3.04 |
| Interquartile range (IQR) | 0.76 |
Descriptive statistics
| Standard deviation | 1.0441538 |
|---|---|
| Coefficient of variation (CV) | 1.0348412 |
| Kurtosis | -0.13423794 |
| Mean | 1.0089991 |
| Median Absolute Deviation (MAD) | 0.34666667 |
| Skewness | -1.163099 |
| Sum | 119987.14 |
| Variance | 1.0902571 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 23190 | |
| 1 | 13526 | 11.4% |
| 2 | 5827 | 4.9% |
| 1.04 | 5735 | 4.8% |
| 2.026666667 | 5345 | 4.5% |
| 1.573333333 | 4802 | 4.0% |
| 1.173333333 | 3846 | 3.2% |
| 1.346666667 | 3556 | 3.0% |
| 1.706666667 | 3385 | 2.8% |
| 1.013333333 | 3049 | 2.6% |
| Other values (71) | 46656 |
| Value | Count | Frequency (%) |
| -1 | 23190 | |
| 1 | 13526 | |
| 1.013333333 | 3049 | 2.6% |
| 1.026666667 | 13 | < 0.1% |
| 1.04 | 5735 | 4.8% |
| 1.053333333 | 250 | 0.2% |
| 1.066666667 | 525 | 0.4% |
| 1.07445857 | 222 | 0.2% |
| 1.08 | 631 | 0.5% |
| 1.083333333 | 104 | 0.1% |
| Value | Count | Frequency (%) |
| 2.04 | 678 | 0.6% |
| 2.032511106 | 538 | 0.5% |
| 2.026666667 | 5345 | |
| 2.013333333 | 1559 | 1.3% |
| 2 | 5827 | |
| 1.997189947 | 268 | 0.2% |
| 1.986666667 | 2756 | |
| 1.973333333 | 1006 | 0.8% |
| 1.96 | 161 | 0.1% |
| 1.946666667 | 1577 | 1.3% |
population
Real number (ℝ)
High correlation 
| Distinct | 426 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4836802 |
| Minimum | 1 |
|---|---|
| Maximum | 12.767484 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 929.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1.0071334 |
| Q1 | 1.0382275 |
| median | 1.2388702 |
| Q3 | 1.5066849 |
| 95-th percentile | 2.1686553 |
| Maximum | 12.767484 |
| Range | 11.767484 |
| Interquartile range (IQR) | 0.46845747 |
Descriptive statistics
| Standard deviation | 1.3365421 |
|---|---|
| Coefficient of variation (CV) | 0.90082896 |
| Kurtosis | 59.920797 |
| Mean | 1.4836802 |
| Median Absolute Deviation (MAD) | 0.2063996 |
| Skewness | 7.5440015 |
| Sum | 176434.8 |
| Variance | 1.7863448 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.353334517 | 1006 | 0.8% |
| 1.355258721 | 961 | 0.8% |
| 1.350525336 | 957 | 0.8% |
| 2.004527671 | 881 | 0.7% |
| 1.03414855 | 871 | 0.7% |
| 2 | 863 | 0.7% |
| 1.515985518 | 855 | 0.7% |
| 1.649419475 | 851 | 0.7% |
| 1.651501074 | 851 | 0.7% |
| 1.033642284 | 850 | 0.7% |
| Other values (416) | 109971 |
| Value | Count | Frequency (%) |
| 1 | 4 | < 0.1% |
| 1.00052611 | 21 | < 0.1% |
| 1.001143996 | 7 | < 0.1% |
| 1.00115855 | 56 | < 0.1% |
| 1.001419962 | 103 | 0.1% |
| 1.001780848 | 224 | |
| 1.001800063 | 147 | 0.1% |
| 1.002195008 | 388 | |
| 1.002447205 | 180 | |
| 1.002659153 | 446 |
| Value | Count | Frequency (%) |
| 12.76748385 | 374 | |
| 12.75950595 | 273 | |
| 12.73412598 | 186 | |
| 12.69443396 | 129 | 0.1% |
| 12.63819355 | 80 | 0.1% |
| 12.61574117 | 104 | 0.1% |
| 12.56876735 | 48 | < 0.1% |
| 12.52321419 | 72 | 0.1% |
| 12.50109656 | 48 | < 0.1% |
| 12.43051548 | 13 | < 0.1% |
prev_perc
Real number (ℝ)
High correlation 
| Distinct | 3602 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.057682063 |
| Minimum | 5.9541615 × 10-7 |
|---|---|
| Maximum | 0.66680351 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 929.2 KiB |
Quantile statistics
| Minimum | 5.9541615 × 10-7 |
|---|---|
| 5-th percentile | 0.00011181244 |
| Q1 | 0.0024639518 |
| median | 0.01879598 |
| Q3 | 0.085566763 |
| 95-th percentile | 0.21944991 |
| Maximum | 0.66680351 |
| Range | 0.66680292 |
| Interquartile range (IQR) | 0.083102812 |
Descriptive statistics
| Standard deviation | 0.091632685 |
|---|---|
| Coefficient of variation (CV) | 1.588582 |
| Kurtosis | 11.834285 |
| Mean | 0.057682063 |
| Median Absolute Deviation (MAD) | 0.018557644 |
| Skewness | 3.0975691 |
| Sum | 6859.3778 |
| Variance | 0.008396549 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.09559488354 | 801 | 0.7% |
| 0.09683559853 | 796 | 0.7% |
| 0.09432083564 | 727 | 0.6% |
| 4.676067381 × 10-5 | 660 | 0.6% |
| 0.09809551509 | 636 | 0.5% |
| 0.0929705782 | 619 | 0.5% |
| 0.00593172072 | 613 | 0.5% |
| 0.08484053454 | 609 | 0.5% |
| 0.08457552731 | 553 | 0.5% |
| 0.08507706745 | 551 | 0.5% |
| Other values (3592) | 112352 |
| Value | Count | Frequency (%) |
| 5.954161519 × 10-7 | 12 | < 0.1% |
| 6.021349343 × 10-7 | 12 | < 0.1% |
| 6.048398308 × 10-7 | 12 | < 0.1% |
| 6.12531721 × 10-7 | 9 | < 0.1% |
| 3.141321775 × 10-5 | 1 | < 0.1% |
| 3.205414826 × 10-5 | 12 | < 0.1% |
| 3.283758691 × 10-5 | 12 | < 0.1% |
| 3.520129062 × 10-5 | 24 | |
| 3.526905456 × 10-5 | 24 | |
| 3.528696692 × 10-5 | 34 |
| Value | Count | Frequency (%) |
| 0.6668035127 | 9 | < 0.1% |
| 0.6666800542 | 3 | < 0.1% |
| 0.6652176577 | 12 | |
| 0.6638663455 | 12 | |
| 0.6630121911 | 12 | |
| 0.6621942684 | 12 | |
| 0.6036428326 | 15 | |
| 0.6000781942 | 24 | |
| 0.5962034647 | 24 | |
| 0.5931450793 | 24 |
price_month
Real number (ℝ)
High correlation 
| Distinct | 3602 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.91244169 |
| Minimum | -1 |
|---|---|
| Maximum | 39.343041 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 24152 |
| Negative (%) | 20.3% |
| Memory size | 929.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | 1.0028542 |
| median | 1.015832 |
| Q3 | 1.3165946 |
| 95-th percentile | 2.2257707 |
| Maximum | 39.343041 |
| Range | 40.343041 |
| Interquartile range (IQR) | 0.31374036 |
Descriptive statistics
| Standard deviation | 1.4145108 |
|---|---|
| Coefficient of variation (CV) | 1.5502479 |
| Kurtosis | 225.91224 |
| Mean | 0.91244169 |
| Median Absolute Deviation (MAD) | 0.20162333 |
| Skewness | 8.6879257 |
| Sum | 108504.83 |
| Variance | 2.0008409 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 24152 | 20.3% |
| 1.004638095 | 335 | 0.3% |
| 1.006020604 | 323 | 0.3% |
| 1.004771886 | 306 | 0.3% |
| 1.005262454 | 285 | 0.2% |
| 1.006555769 | 278 | 0.2% |
| 1.004370512 | 275 | 0.2% |
| 1.00561923 | 262 | 0.2% |
| 1.005663827 | 256 | 0.2% |
| 1.003121795 | 251 | 0.2% |
| Other values (3592) | 92194 |
| Value | Count | Frequency (%) |
| -1 | 24152 | |
| 1 | 30 | < 0.1% |
| 1.00010406 | 36 | < 0.1% |
| 1.000108307 | 7 | < 0.1% |
| 1.000121049 | 12 | < 0.1% |
| 1.000133791 | 20 | < 0.1% |
| 1.000152904 | 41 | < 0.1% |
| 1.00015609 | 4 | < 0.1% |
| 1.000170955 | 24 | < 0.1% |
| 1.000178388 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 39.34304063 | 49 | |
| 10.1026829 | 9 | < 0.1% |
| 9.233829847 | 60 | |
| 9.073092097 | 13 | < 0.1% |
| 8.946663888 | 16 | < 0.1% |
| 8.704930949 | 18 | < 0.1% |
| 8.50771777 | 19 | < 0.1% |
| 8.179681577 | 6 | < 0.1% |
| 8.094738344 | 12 | < 0.1% |
| 8.048833787 | 24 | < 0.1% |
price_unit
Real number (ℝ)
High correlation  Skewed 
| Distinct | 114563 |
|---|---|
| Distinct (%) | 96.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4633047 |
| Minimum | -1 |
|---|---|
| Maximum | 535.92652 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 239 |
| Negative (%) | 0.2% |
| Memory size | 929.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1.0037258 |
| Q1 | 1.0118184 |
| median | 1.0854146 |
| Q3 | 1.4204449 |
| 95-th percentile | 2.4369209 |
| Maximum | 535.92652 |
| Range | 536.92652 |
| Interquartile range (IQR) | 0.40862642 |
Descriptive statistics
| Standard deviation | 5.4641702 |
|---|---|
| Coefficient of variation (CV) | 3.7341302 |
| Kurtosis | 8476.3857 |
| Mean | 1.4633047 |
| Median Absolute Deviation (MAD) | 0.081198512 |
| Skewness | 87.52047 |
| Sum | 174011.8 |
| Variance | 29.857156 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 239 | 0.2% |
| 1 | 196 | 0.2% |
| 1.002194421 | 60 | 0.1% |
| 1.006073823 | 54 | < 0.1% |
| 1.006870241 | 53 | < 0.1% |
| 1.130560009 | 53 | < 0.1% |
| 1.584741207 | 53 | < 0.1% |
| 1.009217574 | 52 | < 0.1% |
| 1.069418063 | 47 | < 0.1% |
| 1.009079098 | 47 | < 0.1% |
| Other values (114553) | 118063 |
| Value | Count | Frequency (%) |
| -1 | 239 | |
| 1 | 196 | |
| 1.000000043 | 2 | < 0.1% |
| 1.000027661 | 1 | < 0.1% |
| 1.000082771 | 1 | < 0.1% |
| 1.000138396 | 1 | < 0.1% |
| 1.000143254 | 1 | < 0.1% |
| 1.000150809 | 1 | < 0.1% |
| 1.000152479 | 1 | < 0.1% |
| 1.000158363 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 535.9265168 | 11 | |
| 83.70638466 | 26 | |
| 27.83922423 | 1 | < 0.1% |
| 27.80384683 | 1 | < 0.1% |
| 27.68080382 | 1 | < 0.1% |
| 27.48212787 | 1 | < 0.1% |
| 26.89041286 | 1 | < 0.1% |
| 26.71832755 | 1 | < 0.1% |
| 26.69198797 | 1 | < 0.1% |
| 26.6798975 | 1 | < 0.1% |
public_perc_che
Real number (ℝ)
High correlation 
| Distinct | 92 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7620059 |
| Minimum | -1 |
|---|---|
| Maximum | 2.0447761 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 741 |
| Negative (%) | 0.6% |
| Memory size | 929.2 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1.238806 |
| Q1 | 1.6716418 |
| median | 1.8358209 |
| Q3 | 1.9253731 |
| 95-th percentile | 2.0149254 |
| Maximum | 2.0447761 |
| Range | 3.0447761 |
| Interquartile range (IQR) | 0.25373134 |
Descriptive statistics
| Standard deviation | 0.30340097 |
|---|---|
| Coefficient of variation (CV) | 0.17219067 |
| Kurtosis | 40.601716 |
| Mean | 1.7620059 |
| Median Absolute Deviation (MAD) | 0.10447761 |
| Skewness | -5.0193021 |
| Sum | 209532.46 |
| Variance | 0.092052151 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 9724 | 8.2% |
| 1.805970149 | 6600 | 5.6% |
| 1.940298507 | 5430 | 4.6% |
| 1.895522388 | 5341 | 4.5% |
| 1.791044776 | 5174 | 4.4% |
| 1.925373134 | 5104 | 4.3% |
| 1.850746269 | 5073 | 4.3% |
| 1.865671642 | 4811 | 4.0% |
| 1.910447761 | 4600 | 3.9% |
| 2.014925373 | 4335 | 3.6% |
| Other values (82) | 62725 |
| Value | Count | Frequency (%) |
| -1 | 741 | |
| 1 | 12 | < 0.1% |
| 1.014925373 | 12 | < 0.1% |
| 1.02104729 | 12 | < 0.1% |
| 1.029850746 | 10 | < 0.1% |
| 1.044776119 | 26 | < 0.1% |
| 1.059701493 | 42 | < 0.1% |
| 1.089552239 | 1 | < 0.1% |
| 1.104477612 | 12 | < 0.1% |
| 1.134328358 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 2.044776119 | 840 | 0.7% |
| 2.029850746 | 3070 | 2.6% |
| 2.014925373 | 4335 | |
| 2 | 9724 | |
| 1.985846555 | 600 | 0.5% |
| 1.985074627 | 100 | 0.1% |
| 1.970149254 | 552 | 0.5% |
| 1.955223881 | 1604 | 1.3% |
| 1.940298507 | 5430 | |
| 1.940104356 | 268 | 0.2% |
therapeutic_area
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 929.2 KiB |
| THER_AREA_96D7 | |
|---|---|
| THER_AREA_66C5 | |
| THER_AREA_980E | |
| THER_AREA_6CEE | |
| THER_AREA_644A | |
| Other values (7) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | THER_AREA_980E |
|---|---|
| 2nd row | THER_AREA_96D7 |
| 3rd row | THER_AREA_96D7 |
| 4th row | THER_AREA_6CEE |
| 5th row | THER_AREA_6CEE |
Common Values
| Value | Count | Frequency (%) |
| THER_AREA_96D7 | 45858 | |
| THER_AREA_66C5 | 22024 | |
| THER_AREA_980E | 20298 | |
| THER_AREA_6CEE | 11871 | 10.0% |
| THER_AREA_644A | 7579 | 6.4% |
| THER_AREA_CD59 | 4578 | 3.8% |
| THER_AREA_032C | 2011 | 1.7% |
| THER_AREA_4BA5 | 1628 | 1.4% |
| THER_AREA_8E53 | 1523 | 1.3% |
| THER_AREA_051D | 846 | 0.7% |
| Other values (2) | 701 | 0.6% |
Length
| Value | Count | Frequency (%) |
| ther_area_96d7 | 45858 | |
| ther_area_66c5 | 22024 | |
| ther_area_980e | 20298 | |
| ther_area_6cee | 11871 | 10.0% |
| ther_area_644a | 7579 | 6.4% |
| ther_area_cd59 | 4578 | 3.8% |
| ther_area_032c | 2011 | 1.7% |
| ther_area_4ba5 | 1628 | 1.4% |
| ther_area_8e53 | 1523 | 1.3% |
| ther_area_051d | 846 | 0.7% |
| Other values (2) | 701 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 283509 | |
| A | 247041 | |
| R | 237834 | |
| _ | 237834 | |
| T | 118917 | |
| H | 118917 | |
| 6 | 109945 | 6.6% |
| 9 | 70734 | 4.2% |
| D | 51394 | 3.1% |
| 7 | 45858 | 2.8% |
| Other values (10) | 142855 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1664838 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 283509 | |
| A | 247041 | |
| R | 237834 | |
| _ | 237834 | |
| T | 118917 | |
| H | 118917 | |
| 6 | 109945 | 6.6% |
| 9 | 70734 | 4.2% |
| D | 51394 | 3.1% |
| 7 | 45858 | 2.8% |
| Other values (10) | 142855 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1664838 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 283509 | |
| A | 247041 | |
| R | 237834 | |
| _ | 237834 | |
| T | 118917 | |
| H | 118917 | |
| 6 | 109945 | 6.6% |
| 9 | 70734 | 4.2% |
| D | 51394 | 3.1% |
| 7 | 45858 | 2.8% |
| Other values (10) | 142855 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1664838 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 283509 | |
| A | 247041 | |
| R | 237834 | |
| _ | 237834 | |
| T | 118917 | |
| H | 118917 | |
| 6 | 109945 | 6.6% |
| 9 | 70734 | 4.2% |
| D | 51394 | 3.1% |
| 7 | 45858 | 2.8% |
| Other values (10) | 142855 |
target
Real number (ℝ)
| Distinct | 112515 |
|---|---|
| Distinct (%) | 94.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.420171 |
| Minimum | 1 |
|---|---|
| Maximum | 28.576068 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 929.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1.0025139 |
| Q1 | 1.0222512 |
| median | 1.0890622 |
| Q3 | 1.3092273 |
| 95-th percentile | 2.8037516 |
| Maximum | 28.576068 |
| Range | 27.576068 |
| Interquartile range (IQR) | 0.28697616 |
Descriptive statistics
| Standard deviation | 1.1833305 |
|---|---|
| Coefficient of variation (CV) | 0.83323098 |
| Kurtosis | 97.597582 |
| Mean | 1.420171 |
| Median Absolute Deviation (MAD) | 0.080570636 |
| Skewness | 8.1239614 |
| Sum | 168882.48 |
| Variance | 1.4002711 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 335 | 0.3% |
| 1.422550727 | 6 | < 0.1% |
| 1.006525941 | 5 | < 0.1% |
| 1.000006181 | 5 | < 0.1% |
| 1.002756112 | 5 | < 0.1% |
| 1.003030932 | 5 | < 0.1% |
| 1.003022759 | 5 | < 0.1% |
| 1.003849005 | 5 | < 0.1% |
| 1.000473016 | 5 | < 0.1% |
| 1.001559779 | 5 | < 0.1% |
| Other values (112505) | 118536 |
| Value | Count | Frequency (%) |
| 1 | 335 | |
| 1.000000008 | 1 | < 0.1% |
| 1.000001788 | 1 | < 0.1% |
| 1.000002796 | 1 | < 0.1% |
| 1.000004597 | 1 | < 0.1% |
| 1.000005248 | 1 | < 0.1% |
| 1.000005676 | 1 | < 0.1% |
| 1.000006181 | 5 | < 0.1% |
| 1.000006896 | 1 | < 0.1% |
| 1.000007662 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 28.57606791 | 1 | |
| 27.52764155 | 1 | |
| 27.29199202 | 1 | |
| 27.08169075 | 1 | |
| 26.8762259 | 1 | |
| 26.84520067 | 1 | |
| 26.54169678 | 1 | |
| 26.07133788 | 1 | |
| 25.77912202 | 1 | |
| 24.75160313 | 1 |
Interactions
Correlations
| che_pc_usd | che_perc_gdp | country | insurance_perc_che | population | prev_perc | price_month | price_unit | public_perc_che | target | therapeutic_area | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| che_pc_usd | 1.000 | 0.776 | 0.921 | -0.169 | -0.430 | 0.042 | 0.263 | 0.223 | 0.398 | 0.177 | 0.086 |
| che_perc_gdp | 0.776 | 1.000 | 0.838 | 0.033 | -0.042 | 0.002 | 0.216 | 0.182 | 0.285 | 0.335 | 0.105 |
| country | 0.921 | 0.838 | 1.000 | 0.962 | 1.000 | 0.230 | 0.119 | 0.059 | 0.973 | 0.107 | 0.148 |
| insurance_perc_che | -0.169 | 0.033 | 0.962 | 1.000 | 0.191 | 0.022 | -0.029 | 0.018 | -0.019 | 0.097 | 0.078 |
| population | -0.430 | -0.042 | 1.000 | 0.191 | 1.000 | -0.081 | -0.006 | -0.093 | -0.158 | 0.360 | 0.070 |
| prev_perc | 0.042 | 0.002 | 0.230 | 0.022 | -0.081 | 1.000 | -0.423 | -0.619 | -0.003 | -0.109 | 0.373 |
| price_month | 0.263 | 0.216 | 0.119 | -0.029 | -0.006 | -0.423 | 1.000 | 0.557 | 0.262 | 0.340 | 0.108 |
| price_unit | 0.223 | 0.182 | 0.059 | 0.018 | -0.093 | -0.619 | 0.557 | 1.000 | 0.146 | 0.265 | 0.022 |
| public_perc_che | 0.398 | 0.285 | 0.973 | -0.019 | -0.158 | -0.003 | 0.262 | 0.146 | 1.000 | 0.148 | 0.084 |
| target | 0.177 | 0.335 | 0.107 | 0.097 | 0.360 | -0.109 | 0.340 | 0.265 | 0.148 | 1.000 | 0.049 |
| therapeutic_area | 0.086 | 0.105 | 0.148 | 0.078 | 0.070 | 0.373 | 0.108 | 0.022 | 0.084 | 0.049 | 1.000 |
Missing values
Sample
| brand | che_pc_usd | che_perc_gdp | cluster_nl | corporation | country | launch_date | date | drug_id | ind_launch_date | indication | insurance_perc_che | population | prev_perc | price_month | price_unit | public_perc_che | therapeutic_area | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | BRAND_354E | 1.209114 | 1.665879 | BRAND_354E_COUNTRY_88A3 | CORP_D524 | COUNTRY_88A3 | 2014-06-01 | 2014-06-01 | DRUG_ID_8795 | -1 | ['IND_C3B6'] | 1.893333 | 1.008039 | 0.028367 | 1.006444 | 1.013784 | 1.835821 | THER_AREA_980E | 1.000784 |
| 1 | BRAND_626D | -1.000000 | -1.000000 | BRAND_626D_COUNTRY_8B47 | CORP_01C7 | COUNTRY_8B47 | 2014-06-01 | 2014-06-01 | DRUG_ID_E66E | 2014-09-01 00:00:00 | ['IND_1590', 'IND_ECAC'] | -1.000000 | 1.023562 | 0.000047 | -1.000000 | 1.626677 | -1.000000 | THER_AREA_96D7 | 1.000000 |
| 2 | BRAND_45D9 | 1.209114 | 1.665879 | BRAND_45D9_COUNTRY_88A3 | CORP_39F7 | COUNTRY_88A3 | 2014-06-01 | 2014-06-01 | DRUG_ID_F272 | -1 | ['IND_B2EF'] | 1.893333 | 1.008039 | 0.001502 | -1.000000 | 3.144874 | 1.835821 | THER_AREA_96D7 | 1.002258 |
| 3 | BRAND_D724 | 1.851280 | 2.051770 | BRAND_D724_COUNTRY_445D | CORP_711A | COUNTRY_445D | 2014-06-01 | 2014-06-01 | DRUG_ID_1D4E | -1 | ['IND_BAFB'] | 1.000000 | 1.253186 | 0.001304 | -1.000000 | 1.213446 | 1.805970 | THER_AREA_6CEE | 1.068761 |
| 4 | BRAND_4887 | 1.791199 | 2.059130 | BRAND_4887_COUNTRY_D8B0 | CORP_443D | COUNTRY_D8B0 | 2014-06-01 | 2014-06-01 | DRUG_ID_AA88 | -1 | ['IND_3F31'] | 2.013333 | 1.639352 | 0.054467 | 1.018589 | 1.008708 | 1.880597 | THER_AREA_6CEE | 1.036312 |
| 5 | BRAND_6E6E | 1.132335 | 1.514478 | BRAND_6E6E_COUNTRY_9488 | CORP_711A | COUNTRY_9488 | 2014-06-01 | 2014-06-01 | DRUG_ID_0383 | -1 | ['IND_BAFB'] | 1.800000 | 1.279048 | 0.002884 | -1.000000 | 1.111391 | 1.791045 | THER_AREA_6CEE | 1.000821 |
| 6 | BRAND_03C2 | 1.812266 | 1.953901 | BRAND_03C2_COUNTRY_9891 | CORP_B65D | COUNTRY_9891 | 2014-06-01 | 2014-06-01 | DRUG_ID_E0F1 | -1 | ['IND_01DA'] | 1.573333 | 1.033353 | 0.349508 | 1.000401 | 1.001102 | 1.820896 | THER_AREA_644A | 1.000280 |
| 7 | BRAND_626D | 1.237984 | -1.000000 | BRAND_626D_COUNTRY_5180 | CORP_01C7 | COUNTRY_5180 | 2014-06-01 | 2014-06-01 | DRUG_ID_E66E | -1 | ['IND_1590', 'IND_ECAC', 'IND_D925'] | 1.933333 | 1.050796 | 0.000051 | -1.000000 | 2.139654 | 1.985075 | THER_AREA_96D7 | 1.002721 |
| 8 | BRAND_F05A | 2.442884 | 1.892659 | BRAND_F05A_COUNTRY_3AD0 | CORP_7E54 | COUNTRY_3AD0 | 2014-06-01 | 2014-06-01 | DRUG_ID_47B7 | -1 | ['IND_4000', 'IND_3A0D'] | 1.573333 | 1.030115 | 0.095441 | 1.003679 | 1.025722 | 1.238806 | THER_AREA_66C5 | 1.030898 |
| 9 | BRAND_CCAA | 1.669007 | 1.764281 | BRAND_CCAA_COUNTRY_89F9 | CORP_8F4F | COUNTRY_89F9 | 2014-06-01 | 2014-06-01 | DRUG_ID_07FE | -1 | ['IND_F338', 'IND_36A0'] | -1.000000 | 1.495485 | 0.012499 | 1.658609 | 1.790405 | 1.940299 | THER_AREA_96D7 | 1.111504 |
| brand | che_pc_usd | che_perc_gdp | cluster_nl | corporation | country | launch_date | date | drug_id | ind_launch_date | indication | insurance_perc_che | population | prev_perc | price_month | price_unit | public_perc_che | therapeutic_area | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 118907 | BRAND_F148 | -1.000000 | -1.000000 | BRAND_F148_COUNTRY_8B47 | CORP_01C7 | COUNTRY_8B47 | 2019-03-01 | 2022-12-01 | DRUG_ID_BE95 | 2020-03-01 00:00:00 | ['IND_F338'] | -1.000000 | 1.025199 | 0.003552 | -1.000000 | 1.746809 | -1.000000 | THER_AREA_96D7 | 1.295619 |
| 118908 | BRAND_BDB6 | 1.183443 | 1.529843 | BRAND_BDB6_COUNTRY_FA79 | CORP_A682 | COUNTRY_FA79 | 2021-07-01 | 2022-12-01 | DRUG_ID_8AB0 | -1 | ['IND_DA0B', 'IND_BD8B'] | 1.785806 | 1.042912 | 0.033305 | 1.126477 | 1.403295 | 1.794286 | THER_AREA_032C | 1.035240 |
| 118909 | BRAND_76D6 | 1.340020 | -1.000000 | BRAND_76D6_COUNTRY_5180 | CORP_A682 | COUNTRY_5180 | 2018-05-01 | 2022-12-01 | DRUG_ID_45AB | -1 | ['IND_8EA5'] | 1.920000 | 1.051939 | 0.018808 | 2.392798 | 1.451953 | 2.044776 | THER_AREA_96D7 | 1.632343 |
| 118910 | BRAND_50D8 | 1.270827 | 1.684065 | BRAND_50D8_COUNTRY_6B71 | CORP_B3B2 | COUNTRY_6B71 | 2018-01-01 | 2022-12-01 | DRUG_ID_D235 | -1 | ['IND_7671'] | 1.440000 | 1.049628 | 0.020591 | -1.000000 | 1.001084 | 1.552239 | THER_AREA_CD59 | 1.006651 |
| 118911 | BRAND_280C | 1.462828 | 1.918488 | BRAND_280C_COUNTRY_907E | CORP_7883 | COUNTRY_907E | 2020-10-01 | 2022-12-01 | DRUG_ID_EC56 | 2020-02-01 00:00:00 | ['IND_3A0D'] | 1.040000 | 1.356278 | 0.092942 | 1.005084 | 1.008337 | 1.843706 | THER_AREA_66C5 | 1.005538 |
| 118912 | BRAND_2058 | 2.074438 | 2.058055 | BRAND_2058_COUNTRY_C8F4 | CORP_3C9A | COUNTRY_C8F4 | 2020-11-01 | 2022-12-01 | DRUG_ID_74A6 | -1 | ['IND_A496'] | -1.000000 | 1.049808 | 0.095233 | 1.054007 | 1.100336 | 2.029851 | THER_AREA_6CEE | 1.203657 |
| 118913 | BRAND_4888 | 1.756234 | 1.819485 | BRAND_4888_COUNTRY_6F78 | CORP_A713 | COUNTRY_6F78 | 2019-07-01 | 2022-12-01 | DRUG_ID_52A5 | -1 | ['IND_617C'] | 1.173333 | 1.008985 | 0.033939 | 1.008317 | 1.029630 | 1.955224 | THER_AREA_980E | 1.109272 |
| 118914 | BRAND_0056 | 1.127497 | 1.491552 | BRAND_0056_COUNTRY_0C7D | CORP_01C7 | COUNTRY_0C7D | 2018-09-01 | 2022-12-01 | DRUG_ID_D637 | -1 | ['IND_FC21'] | 1.826667 | 1.121505 | 0.012526 | 1.017259 | 1.018310 | 1.926795 | THER_AREA_644A | 1.343341 |
| 118915 | BRAND_6200 | 1.874532 | 2.020277 | BRAND_6200_COUNTRY_89F9 | CORP_39F7 | COUNTRY_89F9 | 2020-09-01 | 2022-12-01 | DRUG_ID_B0E9 | 2020-11-01 00:00:00 | ['IND_B2EF'] | -1.000000 | 1.520144 | 0.001334 | 1.960978 | 2.490911 | 1.985847 | THER_AREA_96D7 | 1.266831 |
| 118916 | BRAND_C21A | 1.127497 | 1.491552 | BRAND_C21A_COUNTRY_0C7D | CORP_C3C7 | COUNTRY_0C7D | 2021-05-01 | 2022-12-01 | DRUG_ID_E2D9 | -1 | ['IND_120F', 'IND_8E8D'] | 1.826667 | 1.121505 | 0.041152 | -1.000000 | 1.002689 | 1.926795 | THER_AREA_CD59 | 1.001763 |